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Abstract 

Hierarchically modular systems show a sequence of scale separations in some 
functionality or property in addition to their hierarchical topology. Starting from 
regular, deterministic objects like the Vicsek snowflake or the deterministic scale free 
network by Ravasz et al., we first characterize the hierarchical modularity by the 
periodicity of some properties on a logarithmic scale indicating separation of scales. 
Then we introduce randomness by keeping the scale freeness and other important 
characteristics of the objects and monitor the changes in the modularity. In the pre- 
sented examples sufficient amount of randomness destroys hierarchical modularity. 
Our findings suggest that the experimentally observed hierarchical modularity in 
systems with algebraically decaying clustering coefficients indicates a limited level 
of randomness. 

Key words: Modularity, Fractals, Hierarchical networks, Average-linkage 
hierarchical cluster analysis 



1 INTRODUCTION 



During the last few years a great number of discoveries have formed our view of 
complex networks. It turned out that the scale- free property is an ubiquitous 
feature of most real networks, like the World Wide Web, metabolic networks, 
and collaboration networks, etc. [1,2]. The hierarchical topology of most of 
these networks has also been studied [3,4]. According to recent studies [5,6], a 
hierarchically modular organization lies beneath this topology: 'there are many 
highly integrated small modules which group into a few larger modules, which 
in turn can be integrated into even larger modules' [5]. The so-called hierar- 
chical network model was introduced [4] , which is a simple illustration of the 
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idea above. However, the modularity of real networks is still an open question. 
The problem of the unambiguous identification of modules or communities of 
a system at different hierarchical levels has not been solved ultimately, though 
important advances have been achieved [7] 1 . The average-linkage hierarchi- 
cal clustering algorithm [8] groups the points of a network according to the 
topological overlap between them [5]. Modularity is then attributed to the clus- 
tered structure of the overlaps and hierarchical modularity is obtained if the 
observed clusters can be interpreted such, i.e. if the clusters are hierarchically 
nested. The advantage of this method is that it avoids the delicate problem of 
a priori identification of modules. 

In Ref. [5] metabolic networks were studied and the interesting conclusion was 
drawn that complex networks show hierarchical modularity if they are charac- 
terized by a clustering coefficient C(k) decaying with a power law as a function 
of the degree k of the nodes (often with an exponent close to unity). The clus- 
tering coefficient is a measure of the inter-connectedness of the neighbors of a 
particular node [1,2,9] and it is clearly related to the community structure. In 
Ref. [5] this concept was nicely illustrated by a regular network and used to 
analyse the experimental data obtained for metabolic networks. 

In the present work we address the question of the effect of randomness on the 
hierarchical modularity. In order to do so we introduce a measure of modularity 
in scale free (hierarchical) systems without using identification of modules. We 
apply this concept to randomly rearranged regular fractals and networks. 



2 MODULARITY OF FRACTALS 

2.1 The Vicsek Snowflake 

Understanding the modularity of networks is rather difficult, because it is hid- 
den in the network's topology. Therefore, first we show the effect of modularity 
in the case of fractals. We investigated the modularity of randomly rearranged 
Vicsek snowflakes embedded into two dimensions. 

Our initial object was the well known fractal shown on the left of Fig. 1, which 
has regular self-similarity [10]. At the highest hierarchical level, it consists 
of five well-separated blocks, and each of these blocks contains five smaller 
blocks in the same fashion, and so on. In our case, this hierarchical struc- 



1 In the paper by Clauset et al. [7] a quantity called modularity is defined based 
on already identified modules. We want to avoid this a priori identification (which 
is often a hard task) and we use the term modularity in the every-day sense for 
systems consisting of well identifiable and separable composite units. 
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ture has a finite resolution, so it repeats itself until reaching an elementary 
block size or lower cutoff. The fractal's dimension is given by the formula 
of D = In 5 /In 3 « 1.465. Obviously, for the regular fractal, the blocks (which 
have a finer structure not resolved at this level) can be considered as mod- 
ules at each level of the hierarchy. As these modules are intact until the next, 
finer level of the hierarchy is reached, there is a clear sequence of separation 
of scales: We have hierarchical modularity. Our goal is to measure quantita- 
tively how this hierarchical modularity changes at different levels of random 
rearrangement. 



2.2 Random Rearrangement of the Vicsek Snowflake 



We generated the initial regular Vicsek snowflake by taking a 3 x 3 sized 
bit matrix evenly filled with ones. One bit corresponds to an elementary block 
of the fractal. Then, starting with the highest hierarchical level, our algorithm 
'cut out' the five largest blocks, by turning the corresponding bits to zero. 
After that, taking the next sublevel, the next twenty five sub-blocks were cut 
out from the five large blocks having been generated in the previous step. After 
TV cutting steps one gets the expected Vicsek snowflake with finite resolution. 
The random rearrangement algorithm is based on this cutting method above. 
We got a continuous set of randomly rearranged fractals with a randomness 
being controlled by the parameter p G [0, 1] in the following way: At each 
hierarchical level, we shifted all of the five blocks with probability p to the 
remaining free spaces in the larger block. More precisely, we tossed for each 
of the five blocks whether they would be shifted or not, then we cut out the 
non-shifted ones, and after that we randomly placed the remaining ones to 
the free spaces with uniform distribution. So, before all of the cutting steps 
our rearrangement algorithm made that draw described above. 
In this model p = corresponds to the original Vicsek snowflake, and p = 1 
generates a totally random fractal, in which all the five blocks take place in 
the nine rooms with uniform random distribution at all hierarchical levels. By 
visual inspection it is clear that the degree of hierarchical modularity decreases 
as p changes from to 1 (see Figs. 1-3), and in case p = 1, it should be 
vanished (at least on the average). Note that a p = 1 fractal is also made 
in a hierarchical manner, but the applied random rearrangement destroys its 
regular modular structure by mixing the modules together at every hierarchical 
level. 
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Figure 1. Randomly rearranged fractals of size N = 1; p = (left) and p = 0.2 (right) 
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Figure 2. Randomly rearranged fractals of size N = 1] p — 0.4 (left) and p = 0.6 
(right) 
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Figure 3. Randomly rearranged fractals of size N = 1] p — 0.8 (left) and p—\ (right) 



2.3 Measurement of Hierarchical Modularity 

In Ref. [11] several methods for measuring the dimension of regular fractals 
were compared. In order to quantify modularity, we apply the box counting 
method, where we cover growing regions of the fractal with a growing square 
window started from the centre, and at each step we count the mass (the 
number of elementary blocks) inside the window. By this means, one gets a 
mass function: the inside mass of the fractal as a function of the linear size of 
the window. As pointed out in Ref. [11], for regular fractals this mass function 
is not a straight line in a log-log plot: There are periodic deviations due to 
the above mentioned separation of scales, i.e., due to modularity. Therefore 
we consider the periodicity of these deviations from the straight lines as the 
measure of modularity. 

From our model, presented in Sec. 2.2, one gets different deviation functions 
for different values of p. In order to analyse these functions we applied Fast 
Fourier Transform (FFT) to them. 



2.4 Results 

In this subsection we discuss the results of the measurement of modularity 
described above in Sec. 2.3. 

The log-log plots of the mass deviation functions as a function of the window 
size are plotted in Fig. 4 for different values of parameter p. For non-zero /?, 
each curve is the average of 200 randomly rearranged fractals (with same p). 

Apparently, the mass deviation functions reflect the hierarchical modularity of 
the fractals of different p values. In the p = case, the function has an inher- 
ent structure corresponding to its high degree of modularity. This scale-free 
structure is more visible on the farther periods of the function, because there 
we have more points. This inherent structure of the curves vanishes as p —> 1. 
The maximum deviation also decreases as we increase /?, and the functions 
become smoother and smoother (apart from the basic high peaks with values 
of 0), indicating the vanishing modularity of the fractals. The 0- valued peaks 
correspond to powers of 3: when the linear size of the window reaches powers 
of 3, the window contains a whole sub- fractal with the exact dimension. This 
is the consequence of not mixing the elements in a continuous way. The devia- 
tion is non-positive for all values of /?, which can be explained by the geometry 
of the applied rectangular windowing method in our special case. 
The vanishing modularity could be better represented taking the FFT spec- 
trum of the functions in Fig. 4. We analysed the last three periods of the 
functions with FFT, where they have the finest inherent structure. The re- 



5 



t 1 1 1 1 1 1 1 1 1 1 1 r 




0,0 0,5 1,0 1,5 2,0 2,5 3,0 3,5 

log 10 (window length) 

Figure 4. The deviation functions for fractals N = 7 at different levels of random- 
ization. The value of p decreases by 0.2 from above (p = 1, red) to below (p = 0, 
black). In cases p ^ we averaged 200 randomly rearranged fractals. 



suits are plotted in Fig. 5. 



Because of the triadic organization of the fractals, the fundamental spatial 
frequency is / = 1/ log 3 « 2.096. This corresponds to the 0- valued peaks men- 
tioned above, therefore the first peaks in the FFT spectra are direct conse- 
quences of the geometry of our method. For the regular fractal (p = 0) one 
could notice the first, second and third harmonics which indicate the scale- free 
property of the deviation function. The first harmonics, which indicates the 
fine structure of the deviation functions, significantly decreases for increasing 
values of /?, and the second and third harmonics totally disappear, what verifies 
the decreasing degree of hierarchical modularity of the fractals for increasing 
rearrangement probability p. 
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Figure 5. The FFT spectrum of the deviation functions of Fig. 4. The value of p 
increases by 0.2 from above (p = 0, black) to below (p = 1, red). 



3 MODULARITY OF NETWORKS 



3. 1 The Randomization of the Hierarchical Network Model 



Our starting point was the deterministic, modular hierarchical network model 
of Ravasz and Barabasi [3] (see Fig. 6). Its main features correspond to the 
real networks [3]: the degree distribution is scale- free, the clustering coefficient 
is independent of the size, and follows a power-law as a function of degree: 
C(k) oc k~ a . In real networks, a is usually about 1. 

The regular model also has obviously hierarchically modular structure. As 
real grown networks are to some extent random, the question raises how the 
modularity is influenced by the randomness. In order study this problem, we 
used a link randomization procedure, earlier already applied to investigate 
the influence of randomness on synchronization [12]. In this model two links 
were chosen randomly, and one node of both links was exchanged between the 
two links. This process was executed M x p times, where M is the number of 
the links and p is the control parameter of the randomization. This method 
conserves the degree distribution, as it does not change the degree of any node. 
As p increases from to 1, the average clustering coefficient C first falls rapidly 
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Figure 6. The deterministic hierarchical network at different sizes [3] 
then becomes constant (see Fig. 7). 
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Figure 7. The clustering coefficient as a function of the randomization. Each point 
represents 200 networks of 3125 nodes. 



For p = 1, the degree dependence of C(k) has two regions. It seems that the 
very low degree {k < 10) nodes' behavior is significantly different from the 
rest. Due to this effect and to the crossover it causes, the asymptotics sets in 
rather late. However, it is clear that the distribution is very broad and it has 
possibly a power law tail (Fig. 8). Also due to the small k anomaly the average 
C decrease with increasing system size for the considered number of nodes. 
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Figure 8. Clustering as a function of degree. The slope of the dotted line is —0.913. 
From the average of 400 networks of 78125 nodes, p = 1. 

3.2 Topological Overlap Matrix 

To investigate the presence or absence of modules and hierarchical structure, 
we used the topological overlap matrix (TOM) of the network [5]. The ij 
element of this matrix, 7}y equals the number of mutual neighbours of the 
nodes i and j (plus 1 if i and j are connected), normalized by the minimum 
degree of i and j, so 7]y is between and 1. Thus, the i-th row and column 
represents the overlap of the node i with the other nodes of the network (7# 
is defined as 1). If there is an isolated module of densely interconnected nodes 
in the network, and rows/columns representing the nodes of the module are 
close to each other in the topological overlap matrix, the module appears in 
the matrix as a square centred on the diagonal with elements close to 1. Of 
course, to enable this interpretation, the right sequence of the nodes has to be 
determined, just according to their overlap values, as it is outlined in the next 
paragraph. The hierarchy of modules is represented by a system of smaller 
and smaller (and more and more cohesive) squares embedded into the larger 
squares (in which the overlap of the modules decreases with the module size). 
These features can be easily observed on the matrix of the deterministic model 
(Fig. 9). 

The sequence of the rows/columns representing the nodes in the matrix is es- 
sential to recognize the modules and hierarchy. The rows/columns concerning 
nodes with big overlap have to be next to each other in the TOM, forming 
squares centred on the diagonal. In order to get the right sequence, we slightly 
modified the average linkage clustering method [5,8]. The original algorithm 



9 



0-0,1 0,1-0,2 0,2-0,3 0,3-0,4 0,4-0,5 0,5-0,6 0,6-0,7 0,7-0,8 0,8-0,9 0,9-1 

Figure 9. Topological overlap matrix of the original deterministic hierarchical net- 
work. The number of nodes is 625. 

builds communities joining nodes into 'supernodes' (see Fig. ??: hierarchical 
tree, after a contraction the order in the new supernode has to be determined). 
In each step two nodes are joined, meanwhile the TOM decreases by one row 
and column. 

The basic steps of the algorithm: 

- First it finds the highest element in the TOM and contracts the two corre- 
sponding nodes into a supernode. 

- The corresponding rows and columns of the two original nodes in the matrix 
are contracted into 1 row and 1 column (matrix elements are averaged) , then 
the next round is started. 
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- This procedure is repeated until the TOM decreases into a 1 x 1 matrix 
(every node joined into one supernode). 

The previous steps describe the original algorithm. Because it was unable to 
reconstruct even the simple regular case shown on Fig. 9, some modifications 
were applied: 

- It is possible that the above algorithm finds more than one elements with 
the same high value in the same step. In this case the contraction resulting 
the smallest supernode is performed. If this quantity is also degenerated, 
then the choice is made at random. The goal of this modification is to make 
the growing of the supernodes 

- We are searching for clusters, not just pairs. Therefore the algorithm exam- 
ines that the selected contraction is good from the view of building a cluster 
(containing more than 2 nodes). If a better contraction is possible, that will 
be executed. 

This way a sequence of contractions emerges. Parallel with the above algorithm 
placing the contracted nodes next to each other, the 'clustered' sequence of 
the nodes appear. To make the above algorithm more clear, there is a small 
example: 
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Figure 10. On the left-hand side, the contraction process is shown for 6 nodes. On 
the right-hand side, the resulting dendogram is displayed. Below: the corresponding 
matrices at timestep t — 0(the original TOM), 1,2,3. The grey characters are out of 
interest. 
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Applying this algorithm, the TOM of the randomized deterministic network 
is visually interpretable (Fig. 11): modular organization is not recognizable 




0-0,1 0,1-0,2 0,2-0,3 0,3-0,4 0,4-0,5 0,5-0,6 0,6-0,7 0,7-0,3 0,3-0,9 0,9-1 



Figure 11. The topological overlap matrix of the randomized (p = 1) deterministic 
hierarchical network (625 nodes). 

with this clustering algorithm after the randomization process for p = 1 . This 
network is therefore scale free, it has a broad, probably power law C(k) distri- 
bution - without a modular structure. 



12 



We would like to make the vanishing of the regular modularity quantitatively 
accessible in a similar way as in the case of the fractals: First, the elements of 
the TOM are raised to the third power in order to weaken the influence of the 
many small elements (to 'make contrast'). Then the elements are projected 
(summed) perpendicular to the diagonal (note that it means 2N — 1 sums for 
di N xN matrix), and the result is Fourier- analysed (Fig. 12). A peak in the 




average of the amplitude-spectra indicates the presence of equal-sized mod- 
ules. As p increases from zero, the peaks indicating the presence of the 5-node 
and 25-node modules are decreasing to zero rapidly (see Fig. 13). 
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Figure 13. FFT peaks corresponding to the 5- (a) and 25-node (b) clusters of the 
deterministic, hierarchical network as functions of randomization, averages over 500 
networks of size 625 (squares), 125 (circles), 25 (triangles). The amplitudes are 
normalized by the value corresponding to p = 0. 

4 DISCUSSION 



In this paper we analysed the effect of randomness on the hierarchical mod- 
ularity of scale free structures. The quantitative analysis was based on an 
important feature of modularity: the separation of scales. We studied regular 
structures (Vicsek snowflake, deterministic modular network) where random- 
ness was introduced such that scale freeness and other features (broad dis- 
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tribution of C{k)) were maintained. Appropriate characteristics of modularity 
were chosen using Fourier components of the mass deviation function (for frac- 
tals) or of projections of the elements of the TOM (networks). In both cases 
we observed a significant decrease of hierarchical modularity with increasing 
randomness. For networks, we also observed a rapid fall for small values of p 
in the clustering coefficient and in the Fourier peaks, suggesting a crossover 
similar to the phenomenon described in Ref. [12]. 

It has to be emphasized that the applicability of the Fourier analysis is based 
on the fact that the regular, hierarchical structures had a regular scale sepa- 
ration. If the separation of scales is less regular, hierarchical modularity can 
still exist, however, sufficient randomization would destroy it in this case too. 
In many real networks signatures of hierarchical modularity could be found 
[3] and the same is true for some model networks like the Holme-Kim [13,14] 
network [15]. This indicates that the level of irregularity in these networks is 
far from that reached by randomization in our models. 



5 ACKNOWLEDGEMENTS 

Thanks are due to T. Vicsek, E. Ravasz and A.-L. Barabasi for important 
discussions. 



References 

[1] R. Albert, A-L. Barabasi: Statistical Mechanics of Complex Networks, Rev. 
Mod. Phys. 74, pp. 47-97 (2002) 

[2] M.E.J. Newman: The structure and function of complex networks, SIAM 
Review 45, pp. 167-256 (2003) 

[3] E. Ravasz, A-L. Barabasi: Hierarchical organization in complex networks, Phys. 
Rev. E 67, 026112 (2003) 

[4] A-L. Barabasi, Z. Dezso, E. Ravasz, S-H. Yook and Z. Oltvai: Scale-free and 
hierarchical structures in complex networks, Sitges Proceedings on Complex 
Networks (2002) 

[5] E. Ravasz, et al.: Hierarhical Organization of Modularity in Metabolic Networks, 
Science 297, pp. 1551-1555 (2002) 

[6] Z.N. Oltvai, A-L. Barabasi: Life's Complexity Pyramid, Science 298, pp. 763- 
764 (2002) 



15 



[7] A. Clauset, M.E.J. Newman, and C. Moore: Finding community structure in 
very large networks, Phys. Rev. E 70, 066111 (2004), 

G. Palla, I. Derenyi, I. Farkas, and T. Vicsek, on community identification by 
clique percolation (to appear in Nature) 

[8] M.B. Eisen, P.T. Spellman, P.O. Brown and D. Botstein: Cluster analysis and 
display of genome-wide expression patterns, Proc. Natl. Acad. Sci. USA 95, pp. 
14863-14868 (1998) 

[9] S.M. Dorogovtsev, J. F. F. Mendes: Evolution of networks, Adv. Phys. 51, pp. 
1079-1187 (2002) 

[10] T. Vicsek: Fractal Models for Diffusion Controlled Aggregation, J. Phys. A16, 
L647-651 (1983) 

[11] T. Tel, A. Fulop, T. Vicsek: Determination of fractal dimensions for geometrical 
multifractals, Physica A 159, pp. 155-166 (1989) 

[12] E. Oh, K. Rho, H. Hong, B. Kahng: Modular Synchronization in Complex 
Networks; |arXiv:cond-mat/040820"2l (2004) 

[13] P. Holme B.J. Kim, Phys. Rev. E 65, 026107 (2002) 

[14] G. Szabo, M. Alava, J. Kertesz: Structural Transitions in Scale-free Networks; 
Phys. Rev. E 67, 056102 (2003) 

[15] E. Ravasz, private communication 



16 



